A Pitch-Asynchronous Simple Method for Speech Synthesis by Diphone Concatenation using the Deterministic plus Stochastic Model

نویسندگان

  • Daniel Erro
  • Asunción Moreno
چکیده

One of the most common approaches to speech synthesis is the concatenation of diphones, extracted from a previously recorded database. The prosodic parameters of the recorded speech fragments have to be adapted to the specifications of the new utterances to be synthesized. In this paper, the deterministic plus stochastic model of speech is used to modify and smoothly concatenate the analyzed diphones. A very high quality is reached without pitch-synchronism, and complex calculations like the vocal tract estimation are avoided. Instead, simple linear interpolations and fast calculations are performed, and only harmonically related sinusoids are taken into account. The resynthesis of the concatenated data is carried out by the overlap-add method.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Speech Synthesis System using the Deterministic plus Stochastic Model

In this paper, a high-quality concatenative synthesis system using the deterministic plus stochastic model of speech is described, in which the prosodic modifications are performed by means of very simple and efficient operations, as we reported in a previous work [11]. In particular, pitchsynchrony is not necessary, and linear interpolations substitute other types of estimation. The method for...

متن کامل

Diphone concatenation using a harmonic plus noise model of speech

In this paper we present a high-quality text-to-speech system using diphones. The system is based on a Harmonic plus Noise (HNM) representation of the speech signal. HNM is a pitch-synchronous analysis-synthesis system but does not require pitch marks to be determined as necessary in PSOLA-based methods. HNM assumes the speech signal to be composed of a periodic part and a stochastic part. As a...

متن کامل

A biphone constrained concatenation method for diphone synthesis

Diphone concatenation [1] has the advantages of simplicity and a relatively small database of speech when compared to other concatenative synthesis methods (e.g., [2]). However, diphone concatenation faces two notable problems. The first is coarticulation which extends beyond the scope of a single diphone and entails some degree of contextual mismatch for virtually any diphone in at least some ...

متن کامل

Pitch Contours as Predictors of Audible Concatenation Artifacts

This paper deals with the traditional problem of the occurrence of audible discontinuities at concatenation points at diphone boundaries in the concatenative speech synthesis. While most of the related studies put stress on the spectral component, we focused on the pitch contours and their role as predictors of the discontinuities. To measure the amount of information contained in the pitch con...

متن کامل

Synthesis and Control of Synthesis Using a Generalized Diphone Method

Generalized Diphone Control is a powerful means of building a musical phrase from dictionaries of analysed sound units by building sequences of units and concatenating and articulating them. ~rough a graphical user interface on Macintosh, the Diphone 2.0 software provides analysis, control and synthesis according to various models, such as the Sinusoidal Additive model and the Chant model. A la...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005